[1092] Using MLFlow in the main repo #1127

tjhunter · 2025-10-22T11:30:11Z

Description

./packages/evaluate/src/weathergen/evaluate/run_evaluation.py

Issue Number

Refs #1092

Is this PR a draft? Mark it as draft.

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

tjhunter · 2025-10-22T12:08:24Z

packages/metrics/pyproject.toml

+readme = "../../README.md"
+requires-python = ">=3.12,<3.13"
+dependencies = [
+    "mlflow",


maybe mlflow-skinny would work

tjhunter · 2025-10-29T08:42:22Z

packages/evaluate/src/weathergen/evaluate/run_evaluation.py

+        channels_set = collect_channels(scores_dict, metric, region, runs)
+
+        for run_id, metrics_dict in reordered_dict.items():
+            parent_run = get_or_create_mlflow_parent_run(mlflow_client, run_id)


the problem is the run_id here, which is the inference run.the easiest is probably to query mlflow to find the value of the tag tags.from_run_id (which will be the model).

This or reading the config for this run. I think it will be more complicated.

I pushed an update, getting from_run_id from the run config and using it to generate parent_run.
Upon uploading the scores, ordering on MLFlow looks as below:

We have to check if everything else is in order in terms of relations between run_ids.

Not sure if we should keep the inference run_id in the name or rather have it in the metadata. On the other hand might be good to keep it in the name in case of several inference instances...

tjhunter added 5 commits October 22, 2025 13:17

changes

d1eb030

changes

b187b2a

changes

fdccc49

changes

a50e077

changes

1f67b76

github-project-automation bot added this to WeatherGen-dev Oct 22, 2025

tjhunter commented Oct 22, 2025

View reviewed changes

scores successfully pushed to MLFlow, still need to refactor

56d1436

grassesi closed this Oct 27, 2025

grassesi deleted the tjh/dev/1092_mlflow branch October 27, 2025 13:47

github-project-automation bot moved this to Done in WeatherGen-dev Oct 27, 2025

grassesi restored the tjh/dev/1092_mlflow branch October 27, 2025 16:37

Jubeku added 2 commits October 27, 2025 23:12

try to batch upload all metrics form same runid

5eee087

batch logging all scores of each run_id

2c21122

tjhunter reopened this Oct 29, 2025

tjhunter commented Oct 29, 2025

View reviewed changes

get parent_run by from_run_id

7cbdd10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[1092] Using MLFlow in the main repo #1127

[1092] Using MLFlow in the main repo #1127

Uh oh!

tjhunter commented Oct 22, 2025

Uh oh!

tjhunter Oct 22, 2025

Uh oh!

tjhunter Oct 29, 2025

Uh oh!

Jubeku Oct 30, 2025

Uh oh!

Jubeku Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[1092] Using MLFlow in the main repo #1127

Are you sure you want to change the base?

[1092] Using MLFlow in the main repo #1127

Uh oh!

Conversation

tjhunter commented Oct 22, 2025

Description

Issue Number

Checklist before asking for review

Uh oh!

tjhunter Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

tjhunter Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Jubeku Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

Jubeku Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants